Picture for Tianyi Zhou

Tianyi Zhou

VideoHallu: Evaluating and Mitigating Multi-modal Hallucinations for Synthetic Videos

Add code
May 02, 2025
Viaarxiv icon

Federated Adapter on Foundation Models: An Out-Of-Distribution Approach

Add code
May 02, 2025
Viaarxiv icon

Skill Discovery for Software Scripting Automation via Offline Simulations with LLMs

Add code
Apr 29, 2025
Viaarxiv icon

WALL-E 2.0: World Alignment by NeuroSymbolic Learning improves World Model-based LLM Agents

Add code
Apr 22, 2025
Viaarxiv icon

Exploring Expert Failures Improves LLM Agent Tuning

Add code
Apr 18, 2025
Viaarxiv icon

GraphicBench: A Planning Benchmark for Graphic Design with Language Agents

Add code
Apr 15, 2025
Viaarxiv icon

How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients

Add code
Apr 14, 2025
Viaarxiv icon

C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Add code
Apr 10, 2025
Viaarxiv icon

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

Add code
Apr 10, 2025
Viaarxiv icon

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

Add code
Apr 09, 2025
Viaarxiv icon